Initialization, Parameters Extraction and Evaluation

نویسنده

  • Zhengrong Yao
چکیده

This thesis covers topics relevant to model-based coding. Model-based coding is a promising very low bit rate video coding technique. The idea behind model-based coding is to parameterize a talking head and to extract and transmit the parameters describing facial movements. At the receiver, the parameters are used to reconstruct the talking head. Since only high-level animation parameters are transmitted, very high compression can be achieved with this coding scheme. In this thesis, we focus on problems related to initialization, parameters extraction and performance evaluation of face tracking system. The thesis consists of mainly the following aspects as numbered: The initialization problem of model based coding is to fit a generic face model onto a target face in the first video frame. It is the first step toward subsequent motion estimation. 1) We proposed a pseudo-automatic initialization scheme, a Analysis-by-Synthesis based on Simulated Annealing is adopt for the initialization task and proved to be efficient. For parameters tracking, 2) A new scheme of performing texture mapping and motion estimation is suggested which use sample based direct texture mapping. 3) The feasibility of using active motion estimation is explored in Model-based coding which proved to be able to give more than 10 times tracking resolution for rotation. 4) Dynamic Programming is used to solve the “loss of tracking” problem in 2D face tracking. An important problem in Model-based coding is the evaluation of motion parameters. 5) We studied the evaluation problems with the commonly used method which employ a magnetic sensor as “ground truth” and pointed out the problems with such method and the importance of solving the performance evaluation problem.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Evaluating the Initialization Methods of Wavelet Networks for Hyperspectral Image Classification

The idea of using artificial neural network has been proven useful for hyperspectral image classification. However, the high dimensionality of hyperspectral images usually leads to the failure of constructing an effective neural network classifier. To improve the performance of neural network classifier, wavelet-based feature extraction algorithms can be applied to extract useful features for h...

متن کامل

Automatic snakes for robust lip boundaries extraction

Active contours or snakes are widely used in object segmentation for their ability to integrate features extraction and pixel candidate linking in a single energy minimizing process. But the sensitivity to parameters values and initialization is also a widely known problem. Performance of snakes can be enhanced by better initialization close to the desired solution. We present here a ne mouth r...

متن کامل

A new nonlinear feature extraction algorithm for speaker verification

In this paper we propose a new feature extraction algorithm based on nonlinear prediction: the Neural Predictive Coding model which is an extension of the classical LPC one. This model is applied to speaker verification by the Arithmetic-Harmonic Sphericity (AHS) method. Two different initialization methods are proposed for the coding method based on the Neural Predictive Coding (NPC): classica...

متن کامل

Towards Robust Lip Tracking

An algorithm for the automatic extraction of speakers lips in video sequences is presented here. Our goal is to extract minimum face feature parameters, vital for audio-visual communication, in adverse conditions and at a very low bit rate coding. Our method uses spatial (region and contour) and temporal (similarity function) information from Luminance and Hue components. A new literal inverse ...

متن کامل

A Combined Approach to Feature Extraction for Mouth Characterization and Tracking

A statistical Active Appearance Model (AAM) is developed to model and track the lips. A major challenge in modelling the mouth area, considered the most deformable facial feature, is the weak contrast between lips and skin colour. In order to perform accurately, AAM requires a strong initialization point. In our approach, key information is firstly extracted using chrominance analysis. Then opt...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2004